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Abstract: This paper considers a wide family of semiparametric repeated 
measures regression models, in which the main interest is on estimating 
population-level quantities such as mean, variance, probabilities etc. Examples 
of our framework include generalized linear models for clustered/longitudinal 
data, among many others. Wc derive plug-in kernel-based estimators of the 
population level quantities and derive their asymptotic distribution. An exam- 
ple involving estimation of the survival function of hemoglobin measures in the 
Kenya hemoglobin study data is presented to demonstrate our methodology. 

1. Introduction 

This paper is about semiparametric regression models with repeated measures when 
the primary goal is to estimate a population quantity such as mean, variance, prob- 
ability, etc. We will construct estimators of these quantities which utilize the under- 
lying semiparametric structure of the model and derive their limiting distribution. 

The work is motivated by the following example: the Kenya hemoglobin data. 
The goal is to study the changes of hemoglobin over time during the first year 
of birth. The data set consists of 68 families with 2 children per family. For each 
child, 4 repeated measures are taken over time in the first year since birth: the 
time of visit varied from child to child. The factors include mother's age at child 
birth; child sex and placental parasitemia density (PDEN), a marker of malaria 
that could affect hemoglobin. To model these data, Lin and Carroll [2] considered a 
semiparametric model where the mother's age effect is modeled nonparametrically 
and (sex, PDEN) is modeled parametrically. The model is given by the repeated 
measures partially linear model 

(1-1) Yijk = Xj^kPo + Oo{Zij) + eijk, 

where i = 1, . . . , n, j 1, . . . , m and k — \, . . . , R. Specifically, Lin and Carroll [2] 
set Yijk. = hemoglobin level of the j^^ child in the i*^ family at the fc"^ visit, Zij = 
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mother's age at the j"^ child birth in the i^^ family, Xijk = {sex, logpden, month, 
(month— 4)+}, where sex = 1 if female and if male, logpden = log(PDEN+l), 
month denotes the age of the child at the fc*'^ visit and the function /+ = / if 
/ > and if /< 0. As noted by Lin and Carroll [2], the covariates {month, 
(month— 4)_|_} are used to model the time effect as a picccwise linear function with 
a knot at 4 months: this trend being observed by preliminary analysis of the data 
set. Also assume that, = (e^n, 6^12, . . . , Cimi?) has a Normal(0, S) distribution 
with S = a'^IniR + p<y'^{JmR — ImR), where ImR is the mR x mR identity matrix 
and JmR is a mR x mR. matrix with 1 as all elements. In this context, apart from 
the model components, namely /3o and 9q{u), one is interested in other population 
level quantities such as mean hemoglobin, E{Y) or various probabilities such as 
the proportion of six month old children who have hemoglobin measure above a 
given constant c, pr(Y' > cjmonth = 6), etc. Note that each of the above-mentioned 
population-level quantities can be written as functions oi X, Z and the model 
parameters. For example, because Z ^ sex and logpden are independent of month, 
we can write 

(1.2) pr(y > c|month ^ a) = E ($[{A:'^/?o + Oo{Z) - c}/cr]|month = a) , 

where c and a are given constants and <&(•) is the standard normal cdf. In general, 
we can consider any functional kq = E{J-{»)} for some function this of course 

includes such quantities as population mean, probabilities, etc. 

The Kenya hemoglobin study example is a special case of a much more general 
framework, one we call semiparametric repeated measures regression modeling. Let 
Zi ~ {Zii, . . . , Zim) and let Yi and Xi be the ensemble of responses and other 
covariates, respectively. Consider a semiparametric problem in which the loglike- 
lihood function conditional on {X, Zi, . . . , Z^) is f-{Y, X, 0o{Zi), . . . , 6o{Zm),Bo}, 
where Bq is the set of model parameters. Also as an important generalization, allow 
Y to be partially missing and let (5 = 1 if y is observed and otherwise. Suppose 
further that Y is missing at random, so that pr((5 = Z) = pr((5 = 1\X, Z). 

If we define Cb{») and Cje{') to be the derivatives of the loglikelihood with respect 
to B and 6{Zj), we have the properties that for j = 1, 2, . . . , m, 

E[SCB{Y,X,eo{Zi),...,eaiZm),Bo}\X,Zi...,Z„^ = 0; 
E[S/:je{Y,X,0oiZi),...,eoiZ^).Bo}\X,Zi...,Z„-,] = 0. 

The main goal of this paper is to estimate a population level quantity, namely 

(1.3) Ko = E[T{X, Z,e^{Zi),..., 0o(Z„), fio}] 

for an arbitrary but smooth function For example, in (1.2) we have J-{») = 

'^[{X'^ (3q+9q{Z) — c} / a] where last two components of X are fixed as a and (a— 4)+. 

Although there are various papers about application of semiparametric methods 
in the context of longitudinal data (see for example Zcger and Diggle [5] and Zhang 
et al. [(i]), to the best of our knowledge there is no literature on estimation of 
arbitrary population-level quantities in the repeated-measures context. See Maity 
et al. [3] for the case of no repeated measurement. However, estimation of Bq and 
9q{») is described in Lin and Carroll [2]. If we call their estimates 13 and 6{»,B), 
then we propose to estimate the population-level quantity as 

n 

(1.4) 

/^scmi 

1 ^{Xt, Z^, 0{Z,i,B), . . . , ^(Z„„ B), B}. 

i=l 
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In this paper, we are interested in the asymptotic behavior of Kgemi- 

This paper is organized as follows. In Section 2, we discuss the general semipara- 
metric problem with loglikelihood £(•) and a general goal of estimating kq. We 
also derive the asymptotic distribution of (1.4). In Section 3, we demonstrate our 
method by applying it to analyze the Kenya hemoglobin study data. All technical 
results are given in an Appendix. 

2. Semiparametric models v^rith multiple components 

2.1. Model component estimation 

Estimation of Bq and 9o{») using profile likelihood and backfitting methods is dis- 
cussed in detail in Lin and Carroll [2] when there are no missing data. With miss- 
ing data, the method works as follows. Let K{») be a smooth symmetric density 
function with bounded support and variance 1.0, let h he a bandwidth and let 
Kh{z) = h-^K{z/h). Define Gij{z, h) = {1, {Zij - z)/h}. For any fixed B = 6,, let 
0c(») be the current estimate in an iterative procedure and (So, Si) be the solution 
of the following estimating equation: 

n m 

= ^^5,A-,,(Z„- -z)G,,(z,/i) 
1=1 j=i 

xCje ^Yi,Xi,9ciZii,B*), . . . ,ao -I- ai{Zij - z),.. . ,9c{Zim,B^)^ . 

We then update 9{») as 9{z,Bt,) — Sq. 

To estimate Bq, a backfitting algorithm can be used. In the iterated backfitting 
algorithm, suppose the current estimate is B^. Then the backfitting estimator of Bq 
modified for missing responses is updated by maximizing in B the function 

n 

J2s^^{Y^,X,,9{Z,uB,),...,9{Z,^,B,),B}, 

i=l 

i.e., solving for B 

n 

= ^(5,/:f5{y„x„?(z,i,i30,---,^(^»«.,^*),^}- 

i=i 

The final estimates are obtained by iterating the process until convergence. 

2.2. Estimation of general population-level summaries 

We take advantage of the results about the asymptotic expansions for B and 9{u) 
provided in Lin and Carroll [2] , with the modification of incorporating the missing 
data indicators. Let Cjke{») = dCje{Y, X,9o{Zi), . . . ,9oiZ,n), Bo}/d9{Zk). Make 
the definitions Q{z) — X^jli fj{^)^{^^jje{*)\Zj = z} and 

m m 

A{B,zi,Z2) - ^^/j(zi)i;{<5/:,fee(»)5(^fe,^2)/f^(^fc)|^, -zi}; 

rn m 

Q{zi,Z2) = ^^fjk{zi,Z2)E {SCjke{»)\Zj = zi,Zk ^ Z2} /fl{z2), 
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where fj{z) is the density of Zj and fjk{zi, Z2) is the bivariate density of {Zj, Z^), 
assumed to have bounded support and are positive on the support. Let ^(^1,22) 
be the solution to 

G{Z1,Z2) = Q{zi,Z2) - A{g,zi,Z2). 

Define Cjetsi*) - dC,g{Y, X, OoiZi), 0o(^™), Bo}/dB and 
et{e,B) = CjeB{Y^,X„0{Za),...,9{Z„-n),B} 

71% 

(2.1) + i^iM^ 9{Z,^),B}eB{Z,k, B), 
where 6b{z,Bo) satisfies 

m 

(2.2) = ^/,(z)i?{5e#(0o,Bo)|Z, = z}. 
Define 

m 

ei = Ub{») + YCijg{»)0B{Zij,Bo)\ 
Ml = E{See^) = ~E{5,Cbb{*) + ^ S^CM'WliZj.Bo)}. 

Define the derivatives of JF(») in a similar way to £{•) and make the following 
definitions: 

m 

M2 = E[TB{*) + Y,^ie{')eB{Zj,BQ)]- 

m 
rn 

C2(^) = E[Y,E{T,e{'>)\Z,}g{Z,,z)/n{Z,)]- 

z?(.) = ^/:,e(.){Ci(z,) + C2(z,)}. 

Then we have the following result. 

Theorem 2.1. Assume that (1^, 5^, X^, Zn^ . . . , Zim), i = 1, 2, . . . , n are indepen- 
dently and identically distributed and nh^ 0. Then, to terms of order Op(l), 

n 

(2.3) « n-l/2^{.f:,(.)-Ko + ^^J^^^''5^e^+'5^A(•)} 

4=1 

(2.4) ^ Normal {Q,V^), 

where V«, = £;{J^(») - kq}^ + + E{5D'^{*)}. 
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Note that we have assumed that kq = E[T{X, Z, 9o{Zi), . . . , 9q{Z„i), Bq}] and 
are scalar, which in principle excludes the estimation of quantities such as 
the population variance and standard deviation. However, the result can be readily 
generalized to handle this case. In general, suppose ki = E{J^i{»)} and K2 = 
E{J-2{*)} for some smooth functions J-i{») and J-2{*)^ and are estimated by ki 
and K2, respectively. Suppose that we are interested in Kg^n ~ .9(^*1, ^2) for some 
function g{»,»). Then we have the following result: 

Corollary 2.1. Let Kgon be estimated by Kgen = 9(^1,^2) o-nd that g{»,») is dif- 
ferentiable with respect to both of its arguments. Let and g2{»,») be the 

derivatives of g with respect to its first and second argument, respectively. Then we 
have 

"-^^^(Kgon - «:gon) = n^/^.gi (ki , K2) (ki - Ki) +ri^/^g2('«l,K2)('?2 - K2) + Op(l) 
Normal{0, Vgcn), 

with Vgon = ff?(«;i,K2)Vi +g|(Ki,K2)V2 + 2^1 (ki , ^2)52 , ^2 ) V12 , where Vi and 
V2 are asymptotic variances ofki andK2, respectively, computed using (2.4) and 

Vl2 = E[{Ti{*) - Kl}{.F2(.) - K2}]. 

Remark 2.1. We note that if kq were a function of only Bq, then a bandwidth 
proportional to ft. ~ n~^/^ can be used, because in this case undcrsmoothing is not 
necessary. The reason for the undcrsmoothing in the general case is the inclusion 
of 0o{») in Kq. Using the results from Lin and Carroll [2] we have 

n ni 
n m 

+n-i J2 ^^^^Je{•)g{z, Z,,)/n{z) + Op{n-^/^). 

i=i ]=i 

The restriction n/i^ removes the Op{h^) bias term. It is suggested in Sepa- 
nski et al. [4] that in many semiparametric problems, the optimal bandwidth for 
estimating parameters such as Bo is of the order n"^/'', which of course satisfies 
nh"^ 0. In the case of no repeated measures, i.e., when m = 1, Maity et al. [.■>] 
find that they have good experience numerically by first estimating the bandwidth 
via likelihood crossvalidation, which will be of order n~'^/^, and then multiplying it 
by n~^/^^ to get the bandwidth to be of order n~^/^. 

Remark 2.2. To estimate the asymptotic variance of kq, one can use the bootstrap 
method. While the justification of use of parametric bootstrap for estimation of 
model parameters is provided in Chen et al. [1], we conjecture that the bootstrap 
works for kq as well. One can also use the plug-in method where one replaces each 
term in the variance expression in (2.4) by their consistent estimators. Constructing 
consistent estimators for the first two terms are fairly straightforward where one 
merely replaces all the expectations by sums in that expression and all the regression 
functions by kernel estimates. However, the main difficulty lies in the estimation of 
the third term where one has to solve the functional equation Z2) = Q{zi, Z2) — 
A{g, zi, Z2) for Q{», •), which can be numerically difficult. 

2. 3. General functions of the response 

One interesting and important scenario is when can be constructed using only the 
responses. Suppose that kq = E{Q{Y)}. Also define T{X,Z,9q{Zi), . . . ,9Q{Zjn), 
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Bq} = E{Q{Y)\X , Z}. In the presence of missing data, there are various estimators 
one can use. We will discuss two of these estimators which are based upon different 
constructions for estimating the missing data process. 

Let = T{X,, Zi, e{Zii,B), B),B}. To form the first estimator, 

we impute G(Yi) by for every missing Yi. We estimate kq by 

n 

Til = n"'^{<5^e(?^) + (l-<^^)^^(.)}. 
i=l 

Note that in absence of missing data, ki does not use the semiparametric model 
and is consistent. If the responses are missing at random, then ki is consistent if 
the semiparametric model specification is correct. 

The second estimator depends on the underlying structure of the missing jiata 
process where we assume a parametric formulation for estimating pr((5 = l|y,X, 
Z) = 7r(X, Z, <^), where C is an unknown parameter estimated by standard logistic 
regression of S on {X, Z). Then construct the estimator 



K2 



7^(.X,,Z,,C) I n{X,,Z,,0 



This estimator has the double-robustness property that if cither the semiparametric 
model specification for {do,Bo) or the parametric model for tt{X,Z,() is correct, 
K2 is consistent and has an asymptotic normal distribution. 

If both the models are correct then the following results are true. The proofs are 
given in the Appendix. 

Lemma 2.1. Define 

m 

M2MP - E[{l~6){TBi») + J2Tje{»)eBiZj,Bo)}]; 

m 

j=i 

m 

C2Mp{z) = E[Y,E{{l-S)J^,g{.)\Z,}g{Z„z)/n{Z,)]; 

m 

Amp(») = ^/:je(»){Ciamp(^,)+C2amp(^,)}. 

Assume that nh^ 0. Then, to terms of order OpinT^/'^), 

n 

Hi-Ko « n-i^{<5,a(?«) + (l-<5.,)^,(»)-Ko 
(2.5) +Ml,^pMi'S,e, + (5,;A.imp(»)}. 

Liemma Z.2. Define 7r^(X,Z,C) = dTr{X , Z , Q/dC- Assume that n^/^{C - C) = 
"-~^^^E"=i V'jc(»)+Op(^"^^^) with E{i;t;{»)\X,Z} = 0. Also assume thatnh^ 0. 
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Then, to terms of order Op{n ^^^), 

5r 



K2 — Kq « n ^ ^ 

1=1 



Remark 2.3. The expansions (2.5) and (2.6) show that ki and K2 are asymptot- 
ically normally distributed. One can show that the asymptotic variances are given 
as 

= var|^a(?) + (l- W»)+-^J,imp-^r'^e + '5Amp(») 



V.,o = var 



respectively, from which estimates are readily derived. Note that the estimation 
of Bo and Oq[») has no impact on the asymptotic distribution of K2. Hence it has 
the distinct advantage that its limiting distribution does not involve an integral 
equation. 

3. Application 

3.1. Motivating example 

We applied our method to analyze the Kenya hemoglobin data which is described 
in Section 1. The model is given by (1.1) where n = 68, m = 2 and i? = 4. 
Comparing this to the general model, we see that Yi = (j/in, 2/^24)"'", Xi = 
(Xiii, . . . ,Xi2A)^ and = Z12 = Z^^ = Z,i and Z^^ = Z^e = Z^j = Zm. Also, 
B ~ (/J'^jCr)'^. Let 64 be a vector of ones of length 4. The loglikelihood function, 
conditional of X and (Zi, Z2), is that of a multivariate Gaussian distribution with 
mean A'^/3o + {6'o(Zi)eJ, 0o(-^2)eJ}'^ and covariance matrix S. 

Recall that we are interested in the proportion of all children of a given age a 
who have their hemoglobin measure above any given constant c, which is defined 
as 

(3.1) Ko = E(^mx'^l3o + 6Q{Z)-c]/(j]\uionth^a^. 

Note that the joint density of (Z, sex, logpden) is independent of "month" and hence 
the conditional expectation in (3.1) can be written as £;($[{ A'^/Jq + 6*0 (Z) - c}/a]) 
with the last two components of A being fixed as a and (a — 4)+. Using this fact, 
we estimate /to by 

n ni R 

(3.2) ;?,e„n = {nTOi?}-i^^^$[{AT^^+^(Z,„^)-c}/a], 

i=l j=l k=l 

where the last two components of Xijk are fixed as a and (a — 4)+. Also note that 
there are no missing responses involved in this setup. 

We proceed to estimate Bq and 0o(») as indicated in Lin and Carroll [2] and use 
these estimates to compute (3.2). We estimate Ksomi for a = 3,6 and for different 
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Hemoglobin level (c) 

Fig 1. Results for the Kenya hemoglobin data example. Plotted are estimated survival function 
(1 — the cdf) of hemoglobin measure of children of a given age a, along with 95% confidence 
intervals. Dashed lines correspond to six-month-old children and solid lines correspond to three- 
month-old children. 




0.5 1 1.5 2 2.5 3 3.5 4 4 5 6 7 8 9 10 11 

Age (in months) Age (in months) 



Fig 2. Results for the Kenya hemoglobin data example. Plotted are estimated survival function 
(1 — the cdf) for a fixed hemoglobin level c = 10 as age varies from 0.1 months to 4 months (left 
panel) and 4 months to 11 months (right panel). 

values of c. We estimate the limiting variances via the bootstrap and use those 
to construct 95% confidence intervals. The results arc displayed in Figure f. It 
is interesting to note that the two estimates differ significantly and a decrease in 
proportion is evident as the age of the children increases. We can observe form 
Figure 1 that for a given hemoglobin level, the proportion of children having a 
higher hemoglobin level at age three is significantly less than at age six, indicating 
an increase in risk of getting anemia (low hemoglobin level) as children grow older. 
To observe this effect, we set the hemoglobin level, c = 10 and estimated the survival 
function over a grid of value for age, a. Since the time effect is modeled as a piecewise 
linear function with a knot at 4 months, we plot the estimated survival function in 
two different plots in Figure 2: left panel displays the survival function as age varies 
in [0.1, 4] and right panel plots the survival function when age varies in (4, 11]. The 
decrease in hemoglobin level is much faster for children below age 4 months than 
children older than 4 months, after which the survival function becomes flatter. 
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Note that one can construct other estimators for kq, for example, the sample 
average of hemoglobin measures of only those children who have age a. However, 
it is important to note that the naive estimator does not use the information that 
"month" is independent of (Z, sex, logpden). On the other hand, this vital informa- 
tion is used to construct the semiparametric estimator thus allowing k to be more 
efficient than the naive estimator. Also, note that in this example the use of the 
usual naive estimators may not be justified because the number of children of any 
given age a may be very small. In fact, there are only six children with age a — 3 
and only one child with age a = 6 in the data set we used. This problem docs not 
arise when /?scmi is used because of the fact that it uses all the data to estimate kq 
rather than using only those children who have age a. 

3.2. Simulation study 

We conducted a small simulation study to observe the performance of our method. 
We considered the model given in (1.1) with n = 100 clusters with six measurements 
per cluster. Specifically, we set m = 2 and i? = 3 with Zn = Zi2 = Zi^ and 
Zii ~ Zi^ = ZiQ. We assume that has a Normal(0, S) distribution with S = 
<y^ImR + pcr^iJmR — Itur)- Wc sct (T^ = 1 and p = 0.4. We generated Z from a 
uniform(0, 1) distribution with the true function Oq{z) = sin(8z — 1). The individual 
components of X are generated from uniform(0, 1) distribution, independent of each 
other and Z. The true value of /3o = [1, 1]"""- 
In this setup, our target was to estimate 

Ko(c) = pr(r > c|Xi ^ 0.5) 

= E(^mX^f3o + eo{Z)-c}/a]\Xi = 0.5^ 

for a given value of c, where Xi denotes the first component of X. We estimate 
Ko(c) by 

n m R 

i=l j = l k=l 

where the first component of Xij^. is fixed to be 0.5. Note again that in definition 
of k(c), we used independence between X and Z to our advantage. 

We employed the method in Lin and Carroll [2] with the Epanechnikov kernel 
function to obtain /? and 0{»). Using these estimates we calculated k(c) over a 
grid of values for c. Also the true survival function was computed analytically. We 
generated 1000 data sets, with results displayed in Figure 3. It is evident that the 
estimated survival function given by our method is very close to the true function, as 
one would expect for large sample size. Note that X is generated from a continuous 
distribution and hence the use of naive estimator is not justified in this case. 

4. Discussion 

In this paper, we considered a general class of semiparametric models, with re- 
sponses missing at random, where the primary goal was to estimate population- 
level quantities kq such as the mean, probabilities, etc. For general semiparametric 
regression models, the asymptotic distribution of a plug-in estimator of kq was 
derived. 
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Fig 3. Results for the simulation study based on 1000 simulated data sets. Plotted are the true 
survival function (dashed line), the mean estimated survival function (1 — the cdf) (solid line), 
along with a 95% confidence range (dash- dotted line). 



We have considered the case that the unknown function 9o{Z) was a scalar 
function of a scalar argument. The results though readily extend to the case of a 
multivariate function of a scalar argument. 



Appendix A: Sketch of technical arguments 



A.l. Sketch of Theorem 2.1 



We first show (2.3). First note that £ is a loglikelihood function conditioned on 
{X, Z) , so that we have 

E{5C.jke{*)\X,Z} = -E{SCjeHi:ke{»)\X,Z}; 
(A.l) E{6CjgBi»)\X,Z} = -E{SCje{»)CBi»)\X,Z}. 

Also, it follows from Lin and Carroll [2], and h — Op(n~^/'*) that 

n m 

?(z, Bo) ~ eoiz) = -n-i J2 ^^Kh{Z,^ - z)C,,e{')l^{z) 

i=i j=i 

n m 

(A.2) +n'^ S^^^3e{•)G{z, Z.,,)/n{z) + Op{n-^/^); 

i=\ j=i 

n 

(A.3) B-Bo = M^^n-^YS,e, + Op{n-^^^). 

i=l 
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n 
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m 



i=i 



-1/2 



i=i j=i 



Because n/i'* 0, using (A. 2), we see 



-1/2 



i=i j=i 

n rn 

,-1/2 



1=1 j=l 



k=l 1=1 

n rn 



i=l J=l 



fe=l f=l 



E E ^kCMei»)Ci{Zki) 

k=l 1=1 
n m 

E E '5fc/:Me(«)C2(Zfc,) + Op(l) 

I 

= n-i/2^J,A(-) + 0p(l). 

i=l 

Result (2.3) now follows from (A. 3). 

Next, we show (2.4). Recall that, e = Cb{»)+YIJ=i Cje{»)OB{Zj,Bn) and D[») = 
E"ii 'Cj0(»){Ci(Zj) + C2{Zj)}. We use (2.1), (2.2) and (A.l) to derive that 

£;{5eD(.)} = -Y^E h/:,e6(.) + ^J/:,fe(,(.)%(Zfe,Ho)i 

i=i L I J 

x{Ci(Z,) + C2(Z,)} 



^i?{<5e*(0o,Bo)|Z,}{Ci(Z,) + C2(Z,)} 
j=i 



= 0. 

Also using the facts that E{D[»)\X ^ Z} = and E{e\X,Z) = 0, it is readily seen 
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that 

E[{:F{*) - Ko}D{.)] = 0; 
E[{T{,) - Ko}e] = 0, 

which in turn prove that all three terms in (2.3) are uncorrelated. Then (2.4) follows 
easily from the central limit theorem. 

A. 2. Sketch of Corollary 2.1 

By a Taylor's series expansion, 

Kgcn - Kgcn = {^1 ('«1 : ^2) (/?! - 1 ) + .92 (^1 , K2) (K2 - ^2) } + Op (n" ^/^) . 

Then, the normality follows from central limit theorem. The variance calculation 
is straight forward since (2.3) together with the fact E{D\X,Z) = E{e\X,Z) = 
implies that 

nE{(Ki ~ ki)(k2 - K2)} = E[{Ti{*) - Ki}{T2{») - K2}]. 

A . 3. Sketch of Lemma 2. 1 

We have that 

n 

Ki = n"'^{(5.g(rO + (l-<5.0^^(»)} = ^i+^2, 

1=1 

say. By a Taylor series expansion, 

n 
n 

rn 

+ {^.b{*) + ^^A^WB{Z^,,B„)}^{B - 60) 

i=i 

■m 

i=i 

71 



n 

1=1 



= n 

1=1 



+n-' ^(1 - 6,)T,jei»){0iZ,,,Bo) - 0o{Z^j)} + Op(l). 

i=i j=i 
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Using (A. 2) and nh"^ 0, we see 



1=1 j=i 

n in n m 

i=l j = l k=l 1=1 

n m n m 

i=l j = l k=l 1=1 



^ ^ ^ <^fe^fc€e(»)Cl,imp(-^fcf) 

k=l 1=1 

n m 

^^4'CA:fe(»)C'2,imp(^fc£) + Op(n"^/^) 
k=l 1=1 

n 

'^<5.A.imp(») + Op(n-i/2). 



1=1 



The result now follows directly from (A. 3). 



A. 4- Sketch of Lemma 2.2 



We have that 



K,2 = n ^ ^ 

i=l 



■K{Xi,ZiX) 



= A1 + A2 



say. By a Taylor series expansion, 



t T^iXi.ZiX) 

Q{Y) 



n{X,ZX) 



^ ^ ~. T n 



In addition, 

A2 
Bi 



Bi + B2 + Op{n~^/^); 

6,, 



1=1 ^ 7r(Xi,Zi,C) 



-1 y-' '5.j:r,;(») 



:t {tt{x,,z,x)}' 



-^a^.:^nCr(C-C)+Op(n-l/2). 



136 A. Matty T. V. Apanasovich and R. J. Carroll 

It follows easily that 



i=l 



0,: 



-n 

n 



El- ~ So)(^- So) 

+0p(n-i/2). 

Using = £'{1 J^\X,Z} and B - Bq = Opin'^/'^), we see that the third 

and fourth terms arc o„ 

(n-i/2). Also, using nh'^ and (A. 2) we see 

,-1 .,--1 1^ TT Aj.Zj.C J 



— 1 ' 

n 

1^1 j^i 



n m f r- 
0, 



^EE 



■ I E E - Z,J)CMe{•)/n{Z^J) I 



n m 

--^EEii-^T^h^-( 

i=i i=i 



n m -v 

' E E hCM9{*)Q{Z,j, Zm)/^{Z,j) \ + 
k=i i=i 



The last step follows because E[{1 — }Tj0{»)\Z] = 0. Hence we see that 

Similarly, using C " C = Op{n-^/'^), B - Bq ^ Op{7i-^/^) and 9{z,Bo) - 6'o(z) 
Op{n^^^^), it also follows that 



B. = 



The result now follows by collecting the terms. 
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